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Highlights 

• The Pauli equation is obtained through logical inference applied to robust experiments on a charged particle. 

• The concept of spin appears as an inference resulting from the treatment of two-valued data. 

• The same reasoning yields the quantum theoretical description of neutral magnetic particles. 

• Logical inference provides a framework to establish a bridge between objective knowledge gathered through experiments 
and their description in terms of concepts. 


I. INTRODUCTION 


In laboratory experiments, one never has complete knowledge about the mechanisms that affect the outcome of the measure¬ 
ments: there is always uncertainty. In addition, the outcomes of real experiments are always subject to uncertainties with respect 
to the conditions under which the experiments are carried out. 

If there are uncertainties about the individual events and uncertainties about the conditions under which the experiment is 
carried out, it is often difficult or even impossible to establish relations between individual events. However, in the case that 
the frequencies of these events are robust (to be discussed in more detail later) it may still be possible to establish relations, not 
between the individual events, but between the frequency distributions of the observed events. 

The algebra of logical inference provides a mathematical framework that facilitates rational reasoning when there is uncer¬ 
tainty [1H5]]. A detailed discussion of the foundations of logical inference, its relation to Boolean logic and the derivation of 
its rules can be found in the papers dll and books Logical inference is the foundation for powerful tools such as 

the maximum entropy method and Bayesian analysis dy]. To the best of our knowledge, the first derivation of a non-trivial 
theoretical description by this general methodology of scientific reasoning appears in Jaynes’ papers on the relation between 
information and (quantum) statistical mechanics Ja,@]- 

A recent paper lU shows how some of the most basic equations of quantum theory, e.g. the Schrodinger equation and the 
probability distributions of pairs of particles in the singlet or triplet state emerge from the application of logical inference to 
(the abstraction of) robust experiments, without taking recourse to concepts of quantum theory. This logical-inference approach 
yields results that are unambiguous and independent of the individual subjective judgement. In addition, this approach provides 
a rational explanation for the extraordinary descriptive power of quantum theory @]. As the introduction of the concept of 
intrinsic angular momentum, called spin, is a landmark in the development of quantum theory, it is natural to ask the question 
under which circumstances this concept appears in a logical-inference treatment. 

A classical review of how the concept of spin has been introduced in quantum theory is given by van der Waerden @]. The 
original motivation to introduce this new concept was the discovery of the anomalous Zeeman effect and its transition to the 
normal Zeeman effect with increasing magnetic field (the so-called Paschen-Back effect). Pauli introduced spin in a very formal 
way by attributing to the electron an additional intrinsic magnetic quantum number taking the values ±1/2 j 10j]. Although the 
picture of the spin in terms of a “rotating electron model” was quickly and widely accepted, Pauli was strongly against this 
picture because of its purely classical-mechanics character. A few years later he suggested the Pauli equation lllll l in which 
this intrinsic degree of freedom was introduced by replacing the single-component wavefunction that appears in Schrodinger’s 
equation by a two-component wavefunction and “Pauli matrices”; the most rigorous way to establish a relation with the idea of 
the rotating electron is just a formal observation that these Pauli matrices satisfy the same commutation relation as the generators 
of the rotation group in three-dimensional space and that the two-component wavefunctions (spinors) provide a double-valued 
representation of this group & 

Bohm and his followers, in the spirit of their general approach to provide a causal interpretation of quantum mechanics, tried 
to construct a purely classical description of spin by analogy with the hydrodynamics of a rotating liquid liT2L IT 3 I 1 . Despite the 
beauty of the mathematical description, the interpretation of the spin as entity, a field, which is distributed over the whole space 
is rather exotic and can hardly be considered as a derivation and justification of the Pauli equation. 

Bohr and Pauli suggested that spin and the related magnetic moment cannot be measured in experiments which can be 
interpreted in terms of classical trajectories (such as Stern-Gerlach experiments with a free-electron beam), see Ref. 0 and 
references therein. In an inhomogeneous magnetic field, spin effects cannot be separated from the effects of the Lorentz force 
due to the orbital motion of the charged particle. However, these difficulties are technical rather than conceptual as they do 
not consider the possibility that there are neutral particles (not subject to the Lorentz force) with magnetic moments, such as 
neutrons, for which Stern-Gerlach experiment is not only possible in principle but has really been performed lfl5tl . It is clear 
now that the naive way to demonstrate the “essentially non-classical” character of the spin degree of freedom premature. 

In this paper, we show how the Pauli equation and the concept of spin naturally emerge from the logical-inference analysis of 
experiments on a charged particle. We carefully analyze the additional assumptions (some of them having obvious analogs in 
Pauli’s analysis of the anomalous Zeeman effect) which are required to pass, in a model-free way, to the Pauli equation. 
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Conceptually, we return to the roots by first introducing “spin” as some intrinsic degree of freedom characterized by a two¬ 
valued number. We will call this two-valued property “color” (e.g. red or blue) to make clear that we leave no room for 
(mislinterpretations in terms of models of rotating particle and the like. This is in sharp contrast to the interpretation of Refs. S3 
andy_3l Note that such a generalization of the concept of spin is veryjmportant in modern physics. For instance, the idea of 
isospin of elementary particles IH which was originally introduced [17] as a way to describe constituents of atomic nuclei in 
terms of the same particles (nucleons) with two subspecies (neutrons and protons). Another example is the pseudospin of the 
charge carriers in graphene [[l8i] used to indicate that the carriers belongs to sublattice A or B of the honeycomb crystal lattice. 
In both of these examples, there is nothing that is rotating! 

We further illustrate the power of the approach by an application to Stern-Gerlach experiments with chargeless, magnetic 
particles, providing additional support to the idea that quantum theory directly follows from logical inference applied to a well- 
defined class of experiments @]. 

To head off possible misunderstandings, it is important to mention that the underlying premise of our approach is that current 
scientific knowledge derives, through cognitive processes in the human brain, from the discrete events which are observed in 
laboratory experiments and from relations between those events that we, humans, discover. As a direct consequence of this 
underlying premise, the validity of the results obtained in our approach does not depend on the assumption that the observed 
events are signatures of an underlying objective reality which is mathematical in nature (for an overview of older and new work 
in this direction, see Refd). We take the point of view that the aim of physics is to provide a consistent description of relations 
between certain events that we perceive (usually with the help of some equipment) with our senses. Some of these relations 
express cause followed by an effect and others do not. A derivation of a quantum theoretical description from logical-inference 
principles does not prohibit the construction of cause-and-effect mechanisms that, when analyzed in the same manner as in 
real experiments, create the impression that the system behaves according to quantum theory I B0l - t22|] . Work in this direction 
has shown that it is indeed possible to build simulation models which reproduce, on an event-by-event basis, the results of 
interference/entanglement/uncertainty experiments with photons/neutrons |23h27[1 . 

The paper is organized as follows. In Section[n]we specify the measurement scenario, introduce the inference-probability that 
characterizes the observed detection events (all the elements of logical inference that are required to for the purpose of the present 
paper are summarized in Appendix[A}. Then, we discuss and formalize the notion of a robust experiment. Although these three 
steps are similar to the ones taken in the logical-inference derivation of the Schrodinger equation Jsl, to make the presentation 
self-contained, we give a detailed account. The next three subsections address the problem of including additional knowledge 
about the motion of the particle in some limiting cases. In subsection III HI we collect the results of the previous subsections and 
derive the Pauli equation. Section [Til] shows that the same procedure leads to the quantum theoretical equation that describes 
the motion of an uncharged particle in a magnetic field. A discussion of the relation of the logical-inference derivation of the 
Pauli equation and earlier work on the hydrodynamic formulation of quantum theory is given in Section [TV] A summary and 
discussion of more general aspects of the work presented in this paper can be found in SectionlVl 


II. LOGICAL INFERENCE: DERIVATION OF THE PAULI EQUATION 
A. Measurement scenario 


We consider N repetitions of an experiment on a particle located in 3-dimensional space Cl. The experiment consists of 
sending a signal to the particle at discrete times labeled by the integer x = 1,... ,M. It is assumed that for each repetition, labeled 
by n = 1 the particle is at the unknown position X T £ Cl. As the particle receives the signal, it responds by emitting 

another signal which is recorded by an array of detectors. For each signal emitted by a particle the data recorded by the detector 
system is used to determine the position j„ T £ "V where "V denotes the set of voxels with linear extent [—A, A] /2 that cover 
the 3-dimensional space Cl. The signal also contains additional information which is two-valued and encodes, so to speak, the 
“color” of the particle at the time when it responded to the signal emitted by the source. This color is represented by variables 
k n t = ±1. The frequency distribution of the (j,k)„ T ’s changes with the applied electric and magnetic field from which we may 
infer that there is some form of interaction between the electromagnetic field and the particle. 

The result of N repetitions of the experiment yields the data set 

k=±l;n=l,...,N-, T = 1.Af}, (1) 


or, denoting the total counts of voxels j and color k at time X by 0 < c,- * T < N , the data set can be represented as 

^ = k x = l,... ,M ; 52 


i=±t je[-L d ,L d ] 


( 2 ) 
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B. Inference-probability of the data produced by the experiment 

The first step is to introduce a real number 0 < P(j,k\X T ,r,Z) < 1 which represents the plausibility that we observe a detector 
click (j,k), conditional on (X t ,t,Z). For reasons explained in Appendix iBl P(j,k\X T ,x,Z) is called inference-probability (or 
i-prob for short) and encodes the relation between the unknown location X T and the location j and color k registered by the 
detector system at discrete time T. Except for the unknown location X T , all other experimental conditions are represented by Z 
and are assumed to be fixed and identical for all experiments. Note that unlike in the case of parameter estimation, in the case at 
hand both P( j,k|X T , T,Z) and the parameters X T are unknown. 

We make the following, seemingly reasonable assumptions: 

1. Each repetition of the experiment represents an event of which the outcome is logically independent of any other such 
event. By application of the product rule (see AppendixlBli. a direct consequence of this assumption is that 

M N 

P(J\X u ...,X Ml N,Z) = n n^MXr,T,Z), (3) 

T=1 n =1 

and hence 

p( 9 \x u ...,x u ,N,z)=mf[Yl n f(j ^l XT,T ; Z)CjM - ( 4 ) 

T=lieYk=±l c 3 ,k,r- 

2. It is assumed that it does not matter where the experiment is carried out. This implies that the i-prob should have the 
property 


P(j,k|X T ,T,Z)=P(j + £,k|X T + C,T,Z), 


( 5 ) 


where £ is an arbitrary 3-dimensional vector. The relation Eq. (0 expresses the assumption that space is homogeneous. 


C. Condition for reproducibility and robustness 


If the frequencies with which the detectors fire vary erratically with {X T }, the experiment would most likely be called “irre- 
producible”. Excluding such experiments, it is desirable that frequency distributions of the data exhibit some kind of robustness, 
smoothness with respect to small changes of the unknown values of {X T }. Unless the experimental setup is sufficiently “robust” 
in the sense just explained, repeating the run with slightly different values of {X T } would often produce results that are very 
different from those of other runs and it is common practice to discard such experimental data. Therefore, a “good” experiment 
must be a robust experiment. 

The robustness with respect to small variations of the conditions under which the experiment is carried out should be reflected 
in the expression of the i-prob to observe data sets which yield reproducible averages and correlations (with the usual statistical 
fluctuations). The next step therefore is to determine the expression for P(j,k\X t , x ,Z) which is most insensitive to small changes 
in X T . It is expedient to formulate this problem as an hypothesis test. Let Hq and H\ be the hypothesis that the same data 3> is 
observed for the unknown locations {X T } and {X T + E r }, respectively. The evidence Ev of hypothesis H\, relative to hypothesis 
Hq, is defined by Hll 


Ev = In 


P(2>\X T + e T ,x,N,Z) 


— ^ c j4,rl n 
j ,*,T 


P(3>\X T ,x,N,Z) 
P(j,k\Xr + 


®Tj LZ) 


( 6 ) 


P(j,k\X t ,x,Z) ’ 

where the logarithm serves to facilitate algebraic manipulations. If H\ is more (less) plausible than Hq then Ev > 0 (Ev < 0). 
In statistics, the r.h.s. of Eq. 0 is known as the log-likelihood function and used for parameter estimation. In contrast, in the 
present context, the function Eq. 0 is not used to estimate X T but is a vehicle to express the robustness with respect to the 
coordinates X T . 

Writing Eq. 0 as a Taylor series in £ we have 


Ev= E c jArl n 

j ,k,t 


1 + 


£ T • V T F(j,k|X T , t,Z) 1 (£t-V t ) P(j,k\X T ,x,Z) 


P(j,k\X z ,X,Z) 


■ + 


P(j,k\X r ,x,Z) 


+ ^(e?) 


£t ' V T P(j, k\X T , T,Z) 

1 

£ t • V T P(j,k|X T , T,Z) 

2 , 1 (£ T -V T ) 2 P(j,k|X T ,T,Z)' 

P{j,k\X T ,x,Z) 

2 

L nU|Xr,T,Z) J 

-1 

la 

u* 

Ico 

- 




( 7 ) 


where V T differentiates with respect to X r . Here and in the following we assume that e T is sufficiently small such that the third 
and higher order terms in the £’s can be ignored. According to our criterion of robustness, the evidence Eq. 0 should change as 
















5 


little as possible as X T varies. This can be accomplished by minimizing, in absolute value, all the coefficients of the polynomial 
in £ r ,/or all allowed £ T and X T . The clause “for all allowed £ z and X T ” implies that we are dealing here with an instance of a 
global optimization problem [ 281 ]. 

The first and third sum in Eq. © vanish identically if we choose c^ z /N = P(\.k\X T , T.Z). Indeed, we have 

^=iv£(e T .V T )V(j,fc|X T) T,z) 

j,*,T 

= A£(£ T -V T )“£p(j,A:|X T ,T,z) 

T i,k 


E c iY 

j.M 


(e T -V T ) g P(j,*|X T , 

P(j,k\X r ,T,Z) 


= a£(£ t -V t )“ 1=0, 


( 8 ) 


for a = 1,2,.... Although this choice is motivated by the desire to eliminate contributions of order £ T , it follows that our 
criterion of robustness automatically suggests the intuitively obvious procedure to assign to P(j,k\X T , T,Z) the value of the 
observed frequencies of occurrences Jill. 

Dropping irrelevant numerical factors and terms of ffitl), the remaining contribution to the evidence 


1 


Ev = V - 

j5^ p (j^i x t’ T ’ z ) 


[£-V T P(j,^|X T ,T,Z)] 2 , 


(9) 


vanishes identically (for all £ T ) if and only if V r _P(j,£|X T , T,Z) = 0 in which case it is clear that we can only describe experiments 
for which the data does not exhibit any dependence on X T . 

Experiments which produce frequency distributions that do not depend on the conditions do not increase our knowl¬ 
edge about the relation between the conditions and the observed data. Therefore, we explicitly exclude such non- 
informative experiments. 

Thus, from now on, we explicitly exclude the class of experiments for which V T _P(j,fc|X x , T,Z) = 0. 

The clause “for all allowed £ x ” can be eliminated using the Cauchy-Schwarz inequality. We have 


Ev = E 

j,M 


£ x • V T P(j,k|X T , T,Z) 


-i 2 


< e z 


1 


■[V x P(j,^|X T ,T,Z)] ; 


( 10 ) 


P l / 2 (j,k\X T ,T,Z) \ ^ r P(j,k\X T ,t,Z) 

where £ 2 = max T £~. As the £ r ’s are arbitrary (but small), it follows from Eq. (ITOl) that we find the robust solution(s) by searching 
for the global minimum of 


j _ y _i_ 

F fc t P(i,k\X t ,T,Z) 


^rP{jik\X Tl T,Z) 


(ID 


which is the Fisher information of the measurement scenario described above. 


D. Continuum limit 


Propositions such as “detector (]. : k) has clicked at time t” are ultimately related to sensory experience and are therefore dis¬ 
crete in nature. On the other hand, the basic equations of quantum theory such as the Schrodinger, Pauli and Dirac equations 
are formulated in continuum space. Taking the continuum limit of the discrete formulation connects the two modes of descrip¬ 
tion. Here and in the following, we use the symbols for (partial) derivatives for both the case that the continuum approximation 
is meaningful and the case that it is not. In the latter, operator symbols such as d/dt should be read as the corresponding 
finite-difference operators. 

Assuming that the continuum limit is well-defined, we have V —> Q and the Fisher information reads 


If = dxdt^ ^ 

j=u=± 

/‘ 

1 


1 


=±l P(x,k\X,t,Z) _ 

I d * d %JiiP(x, k \x, t ,z) . 



dxdt Y" 1 —--;-- 

kh±\ P{x,k\X,t,Z) 


VP(x,k|X,f,Z)] 2 


( 12 ) 


where V denotes derivatives with respect to x and we have simplified the notation somewhat by writing X = X t . We have 
changed derivatives with respect to X to derivatives with respect to x by assuming that (P(x.k\X.t.Z) = P(x + y,k|X + y,f,Z) 
holds for all y (see assumption 2 in Section IUbT ). Furthermore, it is understood that integrations are over the domain defined 
by the measurement scenario. Technically speaking, after passing to the continuum limit, P(x\X.t.Z) denotes the probability 
density, not the probability itself. However, as mentioned above, we write integration and derivation symbols for both the discrete 
case and its continuum limit and as there can be no confusion about which case we are considering, we use the same symbol for 
the probability density and the probability. 
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For later use, it is expedient to write Eq. (fl2l > in a different form which separates the data about the position of the clicks and 
the associated color k as much as possible. According to the product rule, we have 

P(x,k\X,t,Z) = P{k\x,X,t,Z)P(x\X,t,Z), (13) 

which we may, without loss of generality, represent as 

P(x,k = +1|X, t,Z) = F(x|X,f,Z)cos 2 ^ 1 —- 

P(x,k = — l|X,f,Z) = F(x|X,t,Z)sin 2 (14) 

Substituting Eq. (IT4l > into Eq. ( 1 1 21 ) we obtain 


/F = J dxdt | - 


1 


[VP(x|X,f,Z)] 2 + [V0(x,X,f ,Z)] 2 F(x|X,f ,Z) 


.P(x|X,t,Z) L " - ’ (15) 

which is the Fisher information for the measurement scenario described earlier. Note that up to this point, we have not assumed 
that the particle moves or carries a magnetic moment nor did we assign any particular meaning to 9{x,X,t,Z). 

According to the principle laid out earlier, our task is to search for the global minimum of Eq. 03, the Fisher information of 
the measurement scenario described above, thereby excluding the uninformative class of solutions. 


E. Including knowledge 


It is instructive to first search for the global minimum of Eq. ( fl5] > in the case that we do not know whether the particle moves 
or not and do not know about the effect of the applied electromagnetic field on the frequency distribution of the (j,k)„ T ’s. In 
this situation, we may discard the time dependence altogether and search for the non-trivial global minimum of 


I F = 


/*{; 


1 


■ [VP(x|X,Z)]- + [V0(x,X,Z)] 2 P(x|X,Z; 


(16) 


.P(x|X,Z) 

For pedagogical purposes, we now specialize to the case of one spatial dimension and discard the color dependence, that is 
we set VO(x.X.Z) = 0 and assume that Q [0,L] where [0. L\ is the range covered by the detection system. With the latter 
assumption P(x|X,Z) = 0 for x < 0 or x > L. 

Recalling the assumption that space is homogeneous (see Eq. Q), we search for solutions of the form P(x\X .Z) = f(x-X.Z). 
As /(x —X,Z) > 0, we may substitute P{x\X,Z) = /(x —X,Z) = y/ 2 (x — X,Z) in Eq. (IT6l) and we obtain 

(17) 


dx J 

Recall that the requirement of a global minimum entails that Ip is constant, independent of the unknown position X of the 
particle. 

The extrema of Eq. (fTTI) are easily found by a standard variational calculation. Introducing the Lagrange multiplier ll to 
account for the constraint fg dxy/ 2 (x — X,Z) = fg dxP{x\X ,Z) = 1, the extrema are the solutions of 


d 2 w(x-X,Z) u , 


(18) 


For fj. > 0, the solutions of Eq. (fl8l > are hyperbolic functions, a family of solutions that is not compatible with the constraint 
P{x\X,Z) = 0 for x = 0,L and can therefore be ruled out. Writing u = -4v 2 , the general solution of Eq. JTsl i reads 
ys(x — X,Z) = ci(Z)sin v(x —X) +c 2 (Z)cos v(x — X) 

= [ci (Z) cos vX + C 2 (Z) sin vX] sin vx— [c\ (Z) sin vX — C 2 (Z)cos vX]cos vx, (19) 

where c\(Z) and C 2 (Z) are integration constants. Imposing the boundary condition y/(x — X.Z) = 0 for x = 0 we must have 
ci (Z) sin vX = C 2 (Z) cos vX hence the second term in Eq. (1 1 9| ) vanishes for all x. In addition, imposing the boundary condition 
t/r(x —X,Z) = 0 for x = L, we must have either ci(Z)cos vX + C 2 (Z)sinvX = 0 in which case i j/(x — X,Z) = 0 for all x or 
V = rnt/L for n = 1,2, ... in which case the non-trivial solutions read 


t jf(x-X,Z) = 


ci (Z)cos 


nnX 


-C 2 (Z)sin 


llTZX 


Using ci (Z) sin vX = C 2 (Z)cos vX with v =n7i/L we find that 

i/r(x —X,Z) = [c 2 (Z)+c5(Z)]sin 2 


nnx 


n = 1,2, 


nnx 

~L 


n = 1,2, 


( 20 ) 


( 21 ) 


and from fg dx y/~(x — X,Z) = 1 we find that L[c 2 (Z) + c2(Z)]/2 = 1. Hence 


, , , 2 nnx 

P(x|X,Z) = -sm 2 — 


«= 1 , 2 ,..., ( 22 ) 

which are nothing but the solutions of the Schrodinger equation of a free particle in a one-dimensional box tf29ll . Note that the 
r.h.s of Eq. ( l22l) does not depend on X. In other words, from the measured data we cannot infer anything about the unknown 
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position X, in concert with the notion that the particle is “free”. From Eq. (l20l > it follows that Ip = (2n7i/L) 2 , independent of X 
as it should be. Clearly, the solution for non-trivial global minimum of If is given by Eq. (l22l> with n = 1. 

Returning to the case that the frequency distribution of the (j,£)„ x ’s indicates that the motion of the particle depends on 
the applied electric or magnetic field, we can incorporate this additional knowledge as a constraint on the global minimization 
problem. In general, the global minimization problems that we will consider take the form A If + A where A is a parameter (not 
a Lagrange multiplier) that “weights” the uncertainty in the conditions (represented by If) relative to the knowledge represented 
by the functional 

A= f dxdt F(x,k,t,Z)P(x,k\X,t,Z), (23) 

k=±\ 


where F{x,k,t,Z) is a function which encodes the additional knowledge and which does not depend on the unknown position X. 

The assumption that space is homogeneous allows us to replace derivatives with respect to X by derivatives with respect to x. 
This helps in searching for the global minimum of XIf + A because it can be found by searching for the extrema of 


Xlf +A= / dxdt ^ 

k=± l 


P(x,k\X,t ,Z) 


[VP(x 1 k\X,t : Z)} 2 +F{x 1 k,t,Z)P(x,k\X,t,Z) 


as a functional of the P(x,k\X,t ,Z)’s. By the standard variational procedure, the extrema of Xlf + A are the solutions of 


A[VP(x,/t|X,f,Z)]" 
P 2 (x,k|X,f ,Z) 


+ 2AV 


V.P(x,k|X,f,Z) 


-F(x,k,t,Z) = 0 


k = 1,2, 


(24) 


(25) 


L P(x,k\X,t,Z) J 

On the other hand, the global minimum of A If + A should not depend on unknown X because if it did, it was not a global 
minimum and in addition, the values of A If + A would tell us something about X, a contradiction to the assumption that X is 
unknown. 

Taking the derivative of Eq. (l24l > with respect to X (recall X = X f ) yields 

V t P(x,k\X,t,Z). (26) 

j L L j 

Comparing Eqs. ( |25| > and (l26l > and recalling the constraint V T / J (j A|X T . A.Z) ^ 0 used to eliminate uninformative solutions, we 
conclude that the extrema (and therefore also the global minimum) of Eq. (l24l > are (is) independent of X f , as required. 


[dxdt V 

U v ^a|x,,,z)Z A7 

VP(x,k X,f,Z) 

1 

N 

1 

£=±1 

P-(x,k\X,t,Z) 

L P(x,k\X,t,Z) \ 



F. Motion of the particle 


We consider the limiting case that there is no uncertainty on the position of the particle, that is x = X for all clicks. Then the 
motion of the particle and the motion of the positions of the detector clicks map one-to-one, for each repetition of the experiment 
(by assumption). 

From the data x(f) we can compute the vector field U(x,f) defined by 

f = U(x,). (27) 

In principle, U(x,f) is fully determined by the data obtained by repeating the experiment under different (initial) conditions. In 
practice, however, it is unlikely that we have enough data to compute U(x,t) for all (x,f). 

We only consider the case in which the position of the clicks is encoded by its (x, y,z)-coordinates in an orthogonal frame 
of reference attached to the observer. Under the usual assumptions of differentiabilty etc., we can use the Helmholtz-like 
decomposition of a vector field U(x,f) = VS(x,t) — V x W(x,f). We will not use this form but write [|3ll] 

U(x,0 = VS(x,/)-A(x,f), ( 28 ) 

where S(x,t ) is a scalar function and A(x,f) a vector field. Clearly Eq. (f28t has some extra freedom which we can remove by 
requiring that A(x,f) = V x W(x,f). This amounts to requiring that V ■ A = 0. It is convenient not do this at this stage so we take 
Eq. ( |28| > and will impose V • A = 0 later. As mentioned earlier, if differentiabilty is an issue we should use the finite-difference 
form of the V operators. 

For convenience, we drop the (x.t ) arguments and switch to a component-wise notation in the few paragraphs that follow. 
From Eq. (l27l > and Eq. (l28l > it directly follows that l(30h 


d 2 Xi dUj -ydUi 

dt 2 r)t ^ r>Y- 2 


dt 

d 2 S 

dx:dt 


= 1 dx J 
dAi 
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3 
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d 2 S dA, 


dxidxj dx. 


dS 

dx 


— A 


d_ 

dx{ 


dS 1 


dt+ 2^\dxj Aj 


dS 


3 

E 

i= i 


dA i _ 

dxt dx, 


dAj \ ( dS 


dx 


■ —Aj - 


dAi 

dt 


(29) 


where i= 1,2,3 labels the coordinate of the detector clicks. 
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Introducing the vector field B = V x A the second term in Eq. (129} can we written as 


dAj dAj 


dS 


£ ( dx ■ dx; i ( A r . A n ( At xB 


dx: 


( dx 


\ dt 


(30) 


It is important to note that in order to derive Eq. (|3Q} . it is essential that the position is represented by three coordinates. Switching 
back to the vector notation we have 

d 2 x 


dt 2 


= V 


dS 1 .2 

* + 2 (Vi - A) 


dx dA 

— x B —=—. 
dt dt 


(31) 


Up to now, we have not made any assumption other than that space is three-dimensional. Next comes a crucial step in the 
reasoning. Let us hypothesize that there exists a scalar field <j> = @(x.t) such that 

^ + i(VS — A) 2 = -0. (32) 

Then, upon introducing the vector field E = — V0 — dA/dt, Eq. (OTt becomes 

d 2 x ^ dx 

^“ E + ^ XB - (33) 


Although Eq. d33} has the same the structure as the equation of motion of a charged particle in an electromagnetic field (E.B), 
our derivation of Eq. (133} is solely based on the elementary observation that the data yields the vector field U(x, t ) (see Eq. (128} ). 
some standard vector-field identities and the hypothesis that there exist a scalar field (j) such that Eq. ( 132} holds. No reference 
to charged particles or electromagnetic fields enters the derivation. Put differently (and putting aside technicalities related to 
differentiability), if there exist a scalar field 0 such that Eq. (132} holds, then mathematics alone dictates that the equation of 
motion must have the structure Eq. (133} . with E and B having no relation to the electromagnetic field acting on a charged 
particle. The latter relation is established when the data shows that there is indeed an effect of electromagnetic field on the 
motion of the particle, an effect from which it is inferred that the particle carries charge. This relation can be made explicit by 
introducing the symbols m for the mass and q for the charge of the particle and by replacing A by qA/m (we work with MKS 
units throughout this paper) and (j) by (q(f> + u)/m where u represent all potentials that are not of electromagnetic origin. Then 
we have 


d~x 

l Hf- 


dx 

qE+q 7it xB ’ 


and upon replacing S by S/m and V = q(j) 


dS 1 .j 

37 + — {VS — qA) 
dt Zm 


+ V = 0. 


(34) 


(35) 


Note that we have obtained the Hamilton-Iacobi equation Eq. (135} without making any reference to a Hamiltonian, the action, 
contact transformations and the like. In essence, Eqs. (f28l) — (f35l) follow from Eq. (1 27 h . some mathematical identities and the 
crucial assumption that there exist a V such that Eq. (135} holds. Summarizing: 


If we can find scalar fields S and V and a vector field A(x,f) such that Eq. (I35b holds for all (x,f) then the clicks of 
the detectors will carve out a trajectory that is completely determined by the classical equation of motion Eq. ([34} 
of a particle in a potential and subject to electromagnetic potentials. 


Of course, there is nothing really new in this statement: it is just telling us what we know from classical mechanics but there is 
a slight twist. 

First, given the data x(f) of the detector clicks, this data will not comply with the equations of classical mechanics unless we 
can find scalar fields S (the action) and V (the potential) and a vector field A(x,f) (vector potential) such that Eq. (135} holds. 
Second, in the case of interest to us here, there is uncertainty on the mapping between the particle position X(f) and the position 
of the corresponding clicks x(t ) and there is no reason to expect that Eq. (135} will hold. Instead of requiring that Eq. (135} holds, 
we will require that there exists two scalar fields V k (x,t) for k = ±1 such that 


/ 


dxdt ^ 
*=± 1 


dS k (x,t) 
dt 


-^{VS k (x,t)-qA{x,t)f +V k (x,t) 


P{x,k\X,t,Z) =0, 


(36) 


where we regard the particles that respond with k = +1 or k = — 1 as two different objects, the clicks generated by each object 
being described by its own Hamilton-Jacobi equation with potentials \4(x,t). 

The next step is to disentangle as much as possible the motion of the positions of the clicks from their k-values. We introduce 
S k (x,t) = 5(x,f) — kR(x,t) for k = ±1 and after some rearrangements we obtain 


A = / dxdt ^ 

jfc=±t 

J dxdt | 
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dS k (x,t) 


dt 


1 

2 m 


(VS k (x,t)-qA(x 1 t)) 2 + V k (x,t) 


P(x,k\X,t,Z) 


= / dxdt / — 
Zm L 


(VS(x,f) — qA(x,t)) 
dS(x,t) 


dt 


■ — cos0(x,X,t,Z) 


VR(x,t )) — 2cos 9(x,X,t,Z)VR(x,t) (V5(x,f) — qA(x,t)) 
dR(x,t) 


dt 


-f-Vo(x,f)+Vj(x,f)cos0(x,X,f,Z) \P(x\X,t,Z), 


(37) 
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where Vo(x,f) = [V+i(x,f) + VLi(x,f)]/2, Vi(x,t) = [V+i(x,f) — V_i(x,f)]/2 and we made use of £j. =±1 kP(x,k\X,t ,Z) = 
cos6(x,X,t,Z)P(x\X,t,Z). Omitting the terms involving cos 9(x,X,t,Z) and R(x,t ), Eq. 071) reduces to the expression of 
the averaged Hamilton-Jacobi equation which entered the derivation of the time-dependent Schrodinger equation |[H]. 


G. Including the motion of the magnetic moment 


The function cos 0(x,X,f ,Z) determines the ratio of k = ±1 clicks and R(x,t) = (S_i(x,f) — 5+i(x,f))/2, that is half of the 
difference between the actions of the k = — 1 and k = +1 clicks. We can relate these two functions to the direction of a classical 
magnetic moment by imposing the constraint that when the positions of the clicks (=particle position in this case) do not move, 
we recover the classical-mechanical equation of motion of a magnetic moment in a magnetic field, for every x. 

In the limit that m —> °° (corresponding to the situation that the positions of the clicks hardly change with time) we have 


lim A = 

m—>°o 


1 


dx dt 


dsM -cose(*,x,,x> dR(xJ) 


+ Vo(x,f) + Vj(x,f)cos0(x,X,f,Z) >P{x\X,t,Z). 


(38) 


dt v ’ ’ ’ ' dt 

Without loss of generality, we may assume that Vo(x,f) = Vo(x,t) + Vo(x,f) where Vo(x,t) does not depend on 0(x,X,f,Z) and 
R(x,t) while Vo(x,t) may. Writing V\ (x,f) = Vo(x,r) + V\ (x,f)cos 0(x,X,f,Z), searching for the extrema of Eq. (l38l > through 
variation with respect to cos 0(x,X,f ,Z), R(x,t), S(x,t ) and P{x,t ) yields 


dR(x,t) 

dt 

<9 cos 6(x,X,t,Z) 

di 

dP(x\X 1 t,Z) 
- dt - 

dS(x,t ) 
dt 


dV\(x,t) 


<9cos0(x,X,f,Z) 

5Ei(x,t) 


dR(x,t) 


S{x,t 


(39) 


(40) 


(41) 

(42) 

From Eq. (l4Tb it follows that P(x\X,t,Z) does not change with time, in concert with the assumption that the positions of the clicks 
are stationary. Comparing Eqs. (|39| > and (l40l > with Eq. ( lC7| i. it is clear that we will recover the classical equations of motion 
of the magnetic moment if (i) we set Vi(x,f) = ym(x.r) -B(x,f) where m(x,f) is a unit vector, and (ii) make the symbolic 
identification z = cos0(x,X,f,Z) and <p(x,t) = R{x 1 t)/a where a needs to be introduced to give a(p(x,t) the dimension of 
S(x, t). Substituting the infered expression for V] (x.f ) in Eq. (1371) yields 


= 0 


= cos 0(x,X,f,Z) 


dR(x,t) 
dt 


-V 0 (x,t)-V i(x,f). 


A = 
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dxdt { — 

( 2m . 


(VS(x,f) — qA(x,t)) 2 +a 2 (Vcp(x,t))~ — 2acos9(x,X,t,Z)V<p(x,t) (VS(x,t ) —qA(x,t)) 


dS(x,t) 


dt 


— a cos 0(x,X,f ,Z) 


d(p{x,t) 


dt 


+ Vo(x,t) — aym(x,f)-B(x,f) >P(x|X,f,Z). 


(43) 


H. Derivation of the Pauli equation 


We now have all ingredients to derive the Pauli equation from the principle that logical inference applied to the most robust 
experiment yields a quantum theoretical description y|]. According to this principle, we should search for the global minimum of 
the Fisher information for the experiment, subject to the condition that when the uncertainty vanishes, we recover the equations 
of motion of classical mechanics @|. Thus, we should search for the global minimum of 


F = XI f + A, 


(44) 


where Ip and A are given by Eqs. (IT5l) and (l43l >. respectively. 

In AppcndixfUl it is shown that the quadratic functional Q which yields the Pauli equation is identical to Eq. (l44l) if we make 
the identification Vo(x,f) = q(j)(x,t), a = Ti/2, y = q/m and A = h 2 /8m and 


f P l / 2 (x,k = +l\Xd,Z)e iS ^ x ’‘y h \ 

\P l l 2 (x 1 k = -\\X 1 t 1 Z)e iS ^l h J ' 


( 45 ) 


This then completes the derivation of the Pauli equation from logical inference principles. 
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I. Discussion 

In Section HT fI we showed how to include the knowledge that in the absence of uncertainty the particle’s motion is described 
by Newtonian mechanics. Obviously, this treatment requires the particle to have a nonzero mass. On the other hand, in our 
logical inference treatment of the free particle in Section II.E, the notion of mass does not enter in the derivation of Eq. (l22l) 
but neither does the concept of motion. This raises the interesting question how to inject into the logical inference treatment the 
notion of moving massless particles with spin. We believe that the analogy with the pseudo-spin in graphene mentioned in the 
introduction may provide a fruitful route to explore this issue. 

The carbon atoms of ideal single-layer graphene form a hexagonal lattice with the 7T-band (originating from p.-orbitals of 
carbon atoms) well separated from other bands lfl8tl . The electronic band structure of graphene has the remarkable feature 
that in the continuum limit, the low-energy excitations are described by the two-dimensional Dirac equation for two species of 
massless fermions (corresponding to two valleys, K and K'). The fact that there the wave function of each of these two species 
is a two-component “spinor” is not related to the intrinsic spin of the electron but is a manifestation of the two sub-lattice and 
bipartite structure of the hexagonal lattice El- This feature (Dirac-like spectrum) is present already in the simplest model where 
only the nearest-neighbor hopping is taken into account [3l]] but, actually, it is robust and follows just from discrete symmetries, 
namely, time-reversal and inversion symmetries |18). A generalization to a four dimensional lattice, retaining the property that 
the continuum limit yields the Dirac equation, is given in Ref. |32|. This is a nice illustration of the fact that the model of a 
rotating electron is not the only way to arrive at the concept of spin. In our derivation of the Pauli equation, we have to make 
the additional assumption (based on experimental observations such as the anomalous Zeeman effect) that the interaction of this 
intrinsic degree of freedom with an external magnetic field is described by the standard classical expression for the energy of a 
magnetic moment. 

The next important step might be the derivation of the Dirac equation. The Creutz model j32}l suggests that we should 
consider incorporating into the logical inference treatment, the additional knowledge that one has objects hopping on a lattice 
instead of particles moving in a space-time continuum. Recall that up to Section II.D, the description of the measurement 
scenario, robustness etc. is explicitly discrete. In Section UTDl the continuum limit was taken only because our aim was to derive 
the Pauli equation, which is formulated in continuum space-time. Of course, the description of the motion of the particle in 
Section UTFl is entirely within a continuum description but there is no fundamental obstacle to replace this treatment by a proper 
treatment of objects hopping on a lattice. Therefore it seems plausible that the logical inference approach can be extended to 
describe massless spin-1/2 particles moving in continuum space-time by considering the continuum limit of the corresponding 
lattice model. An in-depth, general treatment of this problem is beyond the scope of the present paper and we therefore leave 
this interesting problem for future research. 

A comment on the appearance of h is in order. First of all, it should be noted that recent work has shown that h may be 
eliminated from the basic equations of (low-energy) physics by a re-definition of the units of mass, time, etc. lf33l [34ll . This 
is also clear from the way Ti appears in the identification that we used to shown that quadratic functional Q which yields the 
Pauli equation (see Eq. (1B4I) ) is the same as Eq. (l44l) . With the MKS units adopted in the present paper, Planck’s constant h 
enters because of dimensional reasons (a = h/2 ) and also controls the importance of the term that expresses the robustness of 
the experimental procedure (A = 7r/8m). The actual value of A can only be determined by laboratory experiments. Note that 
the logical-inference derivation of the canonical ensemble of statistical mechanics HH employs the same reasoning to relate 
the inverse temperature j3 = 1 /ksT to the average thermal energy. 

We end this section by addressing a technicality. Mappings such as Eq. (l45l ) are not one-to-one. This is clear: we can alway add 
a multiple of 2nTi to 5j (x,f) or .S '2 (x, r), for instance. In the hydrodynamic form of the Schrodinger equation [35], the ambiguity 
that ensues has implications for the interpretation of the gradient of action as a velocity field 13hi [3711 . As pointed out by 
Novikov, similar ambiguities appear in classical mechanics proper if the local equations of motion (Hamilton equations) are not 
sufficient to characterize the system completely and the global structure of the phase space has to be taken into consideration |f38j l. 
However, for the present purpose, this ambiguity has no effect on the minimization of F because Eq. ( 144| i does not change if 
we add to 5j (x,f) or .S - i(x,f) a real number which does not depend on (x,f) (as is evident from Eq. (TTTb ) or, equivalently, if we 
multiply <E(x|X,f,Z) by a global phase factor and add a constant to <p(x,f). 


III. STERN-GERLACH EXPERIMENT: NEUTRAL MAGNETIC PARTICLE 

The Stern-Gerlach experiment with silver atoms lf39h and neutrons S demonstrates that a magnetic field affects the motion 
of a neutral particle suggesting that minimalist theoretical description should account for the interaction of the magnetic moment 
of the particle and the applied magnetic field. As is clear from the definition of the Pauli Hamiltonian Eq. (1B2I) . in the Pauli 
equation the magnetic field is directly linked to the charge q of the particle. Therefore, in this form the Pauli equation cannot be 
used to describe the motion of a neutral magnetic particle in a magnetic field. 

In quantum theory, this problem is solved by the ad-hoc introduction of the intrinsic magnetic moment which is proportional 
to the spin and by replacing qh/2m by the gyromagnetic ratio y, the value of which is particle-specific. 
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In the logical-inference treatment, no such ad-hoc procedure is necessary. We simply set q = 0 in Eq. (l43l) and use Eq. ( 1451 ) to 
find the equivalent quadratic form. The Hamiltonian that appears in this quadratic form reads 


H = 



yff-B (x,o, 


(46) 


where y is the gyromagnetic ratio which, in general, is not given by q/m. As mentioned earlier, the appearance in Eq. ( l46l > of 
the Pauli-matrices is a direct consequence of logical inference applied to robust experiments that yield data in the form of the 
position and one of the two kinds of detector clicks. 


IV. RELATION TO EARLIER WORK 

Readers familiar with the hydrodynamic formulation of quantum theory [ 35) and its interpretation in terms of Bohmian me¬ 
chanics l40l Pilll undoubtedly recognize the steps which transform quadratic functional Eq. (IB4b yielding the Pauli equation 
Eq. (IB2I) and the functional Q given by Eq. ( 1B27I ). In fact, the functional Q has been used as the starting point for the hydrody¬ 
namic representation |42] and a causal interpretation iTl^L !43l|44}| of the Pauli equation. In this formulation, the two-component 
spinor can be given a classical-mechanics interpretation in terms of an assembly of very small rotating bodies which are dis¬ 
tributed continuously in space. Within this interpretations spins of different bodies interact. 

Clearly, the logical-inference treatment does not support this interpretation: the functional Eq. ( IB27b is the result of analyzing a 
robust experiment that yields data in the form of (x, k) where x is a 3-dimensional coordinate and k = ± 1 denotes the two-valued 
“color”, together with the requirement that on average and in special cases, the data should comply with the classical-mechanical 
motion. 

An expression of Eq. ( IB27b in which the separation of the contribution of the Fisher information and the classical-field 
mechanical is explicit has been given by Reginatto |[45il . This expression is different from ours. Comparing Eq. (IT5l) with 
Eq. (6,7) in Ref.l45L we find that the expressions are fundamentally different due the fact that the representation (7), when 
substituted in (6), does not yield Eq. ( lB27b . 


V. CONCLUSION 

It is somewhat discomforting that it takes a considerable amount of symbolic manipulations to derive the Pauli equation from 
the combination of the measurement scenario, the notion of a robust experiment and the behavior expected in some limiting 
cases. Therefore, it may be worthwhile to recapitulate what has be done in simple words, without worrying too much about the 
technicalities. 

The first step is to describe the measurement scenario. It is assumed that the object (particle) we are interested in responds to 
the signal that we send to probe it. The response of the object triggers a detection event. In the case at hand, the data representing 
the detector clicks consist of spatial coordinates and two-valued “color” indices. We assign an i-prob to the whole data set. To 
make progress, it is necessary to make assumptions about the data-collection procedure. We assume that each time we probe 
the object, the data produced by the detection system is logically independent of all other data produced by previous/subsequent 
probing. With this assumption, together with the assumption that is does not matter where we carry out the experiment, the 
notion of a robust experiment is found to be equivalent to the global minimum of the Fisher information for the corresponding 
measurement scenario (see Eq. (IT5l) ). 

The next step is to bring in the knowledge that in the extreme case that there is no uncertainty about the outcome of each 
detection event, we expect to observe data that is compliant with classical, Newtonian mechanics both for the motion of a 
particle as well as for the motion of its magnetic moment in the case that the particle does not move (see Eq. d43t). 

The third step is to find the balance between the uncertainty in the detection events represented by Eq. (IT5l) and the “classical 
mechanics” knowledge represented by Eq. (1431) by searching for the global minimum of Eq. (l44l) for all possible unknown 
positions of the particle. The result of this calculation is a fairly complicated non-linear set of equations for the i-prob to observe 
a click. 

The final step is to observe that by transformation Eq. (l45l) . this non-linear set of equations and the Pauli equation are equiva¬ 
lent. The latter, being a set of linear equations, are much easier to solve than their non-linear equivalent. 

In the logical inference approach, the assumption that each time we probe the object, the detection system reports a two-valued 
“color” index and our requirement that in the extreme case mentioned earlier we expect to see the motion of a classical magnetic 
moment automatically leads to the notion of a “quantized” (i.e. two-valued) intrinsic magnetic moment. The notion of spin 
appears as an inference, forced upon us by the (two-valued) data and our assumptions (which do not make reference to concepts 
of quantum theory) that the experiment is robust, etc. 

From a more general perspective, it is remarkable that the logic inference approach introduces the concept of “spin” in a way 
which is not much different from the way real numbers are introduced. Indeed, the latter appear as a necessity to provide an 
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answer to questions such as “what new kind of number do we have to introduce such that the square of it yields the integer 
If n = nr where in is an integer, no new concept has to be introduced but if say n = 2, the answer to the question is given the 
symbolic name \/2. 

Similarly, in our logical-inference treatment the concept of spin naturally appears as a result of describing situations in which 
there is two-valued data and the requirement that in a limiting case we recover the classical equation of motion. This concept of 
spin only exists in our mind, in complete agreement with the fact that this concept maybe put to very good use whenever there 
are two-valued variables that may or may not relate to (intrinsic) angular momentum, as in the theory of the electronic properties 
of graphene, for example [ l8ll . 

It will not have escaped the reader that in the logical-inference derivation of the Pauli equation as well as in earlier work 
along this line |K.46| there are no postulates regarding “wavefunctions”, “observables”, “quantization rules’, no “quantum” 
measurements 1471] ,“Born’s rule”, etc. This is a direct consequency of the basic premise of this approach, namely that current 
scientific knowledge derives, through cognitive processes in the human brain, from the discrete events which are observed 
in laboratory experiments and from relations between those events that we, humans, discover. These discrete events are not 
“generated” according to certain quantum laws: instead these laws appear as the result of (the best) inference based on available 
data in the form of discrete events. In essence, for all the basic but fundamental cases treated so far, the machinery of quantum 
theory appears as a result of transforming a set of non-linear equations into a set on linear ones. The wavefunction, spinor, 
spin, ... are all mathematical concepts, vehicles that render a class of complicated nonlinear minimization problems into the 
minimization of a quadratic forms. As products of our collective imagination, these concepts are extraordinarily useful but have 
no tangible existence, just like numbers themselves. Of course, it remains to be seen whether the logical-inference approach can 
be extended to e.g. many-body and relativistic quantum physics. 

In summary: the Pauli equation derives from logical inference applied to robust experiments in which there is uncertainty 
about individual detection events which yield information about the particle position and its two-valued “color”. This derivation 
adds another, new instance to the list of examples fl,[46!] for which the logical-inference approach establishes a bridge between 
objective knowledge gathered through experiments and their description in terms of concepts. 
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Appendix A: The algebra of logical inference 

This appendix does not contain any original material but is provided to render the present paper self-contained. 

If we are only concerned about quantifying the truth of a proposition given the truth of another proposition, it is possible to 
construct a mathematical framework, an extension of Boolean logic, that allows us to reason in a manner which is unambiguous 
and independent of the individual, in particular if there are elements of uncertainty in the description TO- 

The algebra of logical inference can be derived M from three so-called “desiderata”. The formulation which follows is 
taken from Ref. (ij. 

Desideratum 1. Plausibilities are represented by real numbers. The plausibility that a proposition A is true conditional on 
proposition B being true will be denoted by P[A\B). 

Desideratum 2. Plausibilities must exhibit agreement with rationality. As more and more evidence supporting the truth of 
a proposition becomes available, the plausibility should increase monotonically and continuously and the plausibility of the 
negation of the proposition should decrease monotonically and continuously. Moreover, in the limiting case that proposition A 
is known to be either true or false, the plausibility P(A\B) should conform to the rules of deductive reasoning. In other words, 
plausibilities must be in qualitative agreement with the patterns of plausible reasoning uncovered by Polya 1148!] . 

Desideratum 3. All rules relating plausibilities must be consistent. Consistency of rational reasoning demands that if the rules 
of logical inference allow a plausibility to be obtained in more than one way, the result should not depend on the particular 
sequence of operations. These three desiderata only describe the essential features of the plausibilities and definitely do not 
constitute a set of axioms which plausibilities have to satisfy. 

It is a most remarkable fact that these three desiderata suffice to uniquely determine the set of rules by which plausibilities 
may be manipulated M. Omitting the derivation, it follows that plausibilities may be chosen to take numerical values in the 
range [0,1] and obey the rules i'3-S 

1. P(A\Z) + P(A |Z) = 1 where A denotes the negation of proposition A and Z is a proposition assumed to be true. 

2. P{AB\Z) = P(A\BZ)P(B\Z) = P(B\AZ)P(A\Z) where the “product” BZ denotes the logical product (conjunction) of the 
propositions B and Z, that is the proposition BZ is true if both B and Z are true. This rule will be referred to as “product 
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rule”. It should be mentioned here that it is not allowed to define a plausibility for a proposition conditional on the 
conjunction of mutual exclusive propositions. Reasoning on the basis of two or more contradictory premises is out of the 
scope of the present paper. 

3. P(AA\Z) = 0 and P(A+A\Z) = 1 where the “sum” A + B denotes the logical sum (inclusive disjunction) of the propositions 
A and B, that is the proposition A + B is true if either ,4 or B or both are true. These two rules show that Boolean algebra 
is contained in the algebra of plausibilities. 

The algebra of logical inference, as defined by the rules (1-3), is the foundation for powerful tools such as the maximum 
entropy method and Bayesian analysis M. The rules (1-3) are unique M. Any other rule which applies to plausibilities 
represented by real numbers and is in conflict with rules (1-3) will be at odds with rational reasoning and consistency, as 
embodied by the desiderata 1-3. 

The rules (1-3) are identical to the rules by which we manipulate probabilities I5[ l49l - i5lll . However, the rules (1-3) were not 
postulated. They were derived from general considerations about rational reasoning and consistency only. Moreover, concepts 
such as sample spaces, probability measures etc., which are an essential part of the mathematical foundation of probability 
theory 1501 Dill , play no role in the derivation of rules (1-3). Perhaps most important in the context of quantum theory is that in 
the logical inference approach uncertainty about an event does not imply that this event can be represented by a random variable 
as defined in probability theory & 

There is a significant conceptual difference between “mathematical probabilities” and plausibilities. Mathematical probabili¬ 
ties are elements of an axiomatic framework which complies with the algebra of logical inference. Plausibilities are elements of 
a language which also complies with the algebra of logical inference and serve to facilitate communication, in an unambiguous 
and consistent manner, about phenomena in which there is uncertainty. 

The plausibility P(A\B) is an intermediate mental construct that serves to carry out inductive logic, that is rational reasoning, in 
a mathematically well-defined manner JU]. In general, P(A\B) may express the degree of believe of an individual that proposition 
A is true, given that proposition B is true. However, in the present paper, we explicitly exclude applications of this kind because 
they do not comply with our main goal, namely to describe phenomena “in a manner independent of individual subjective 
judgment”. 

To take away this subjective connotation of the word “plausibility”, we will simply call P(A\B) the “inference- 
probability” or “i-prob” for short. 


A comment on the notation used throughout this paper is in order. To simplify the presentation, we make no distinction 
between an event such as “detector D has fired” and the corresponding proposition “D = detector D has fired”. If we have 
two detectors, say D x where x = ±1, we write P{x\Z) to denote the i-prob of the proposition that detector D x fires, given that 
proposition Z is true. Similarly, the i-prob of the proposition that two detectors D x and D y fire, given that proposition Z is true, 
is denoted by P{x,y\Z). Obviously, this notation generalizes to more than two propositions. 


Appendix B: Pauli equation: quantum theory 


In this appendix, we show that the quadratic form, the minimization of which yields the Pauli equation, is identical to the one 
derived in Sec tionlHl through logical inference. 

The Pauli equation for a particle with mass m and charge q can be written as 


dt 


where 


d> = <b(x,f) 


3*i Cm) \ 

3*2 (M) ) 


is a two-component wavefunction and the Hamiltonian is given by 

H = 2m { CT ‘ H^ V ~4 a (M)]} 2 + #(M) 

= [-//iV -qA(x,t)} 2 +q<p(x,t) - • B(x,f), 


(Bl) 


(B2) 


(B3) 


where <7 = (cr v , a y . a z ) T denote the Pauli-spin matrices. 
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By the standard variational argument, it follows that the Pauli equation is an extremum of the quadratic form (functional) 


«-/ 


= / dxdt 


—O —O^^-J + — ^ + i7rVO T —qA(x,f) < I > ' )( — iftVO — qA(x,t)<t> 

O 1 ^ ■ B(x,f)<J) 

2m 


<90\ 1 


with respect to variations in <f>'. We want to know how Eq. (IB4b looks like when we substitute the polar representation 


for both components of the spinor. We have 

d® k {x,t) 


^ ■ s /P 2 (x,t)e iS ^l h J 1 


dt 


1 dP k (x,t) i dS k (x,t) 


for k = 1,2 and 


dt ^ 


2P k (x,t) dt ft dt 

1 dP k (x,t) , i dS k (x,t ) 




from which it directly follows that 

ih / <90' 


' |_2/\(x,D <9f ft <9f 


flt, 


Likewise we have 


, O O 
2 V dt dt 


dSi(x,t) n f ^ , dS 2 (x,t) 


dt 


-Pi(x,t) + - 


dt 


Piittd)- 


- ih ( VO*. - jA(x,t)® k ) = —ih 


and 


2 in 


VO 


.t m'I, 


7^77TT VP ^ x ’ f ) + i vs *(*-0 - T A ( x ’ f ) 
2P k (x,t) ft ft 


A(x,f)<t>' ) ( VO- —A(x,t)0 ) = 


\M(x,f) 


js k (x,t)/n 


t.2 2 


Furthermore, it follows that 


+ ? [VS i (x,r)-«A(x,,)]-jft(x,0. 
0(x,f)O t O = 0(x,f)[P 1 (x,f)+P 2 (x,f)], 


O' a'O = 2\/ Pi(x,t)P 2 .(x,t)cos 


S 2 (x,t) — S\(x,t) 


O t (J y O = 2 ^ P l (x,t)P 2 (x,t) sin S ^ x '^ ^i( x ^) ) 

n 

O t C7'0 = Pi(x,f)-P 2 (x,f). 

Thus, we have all the expressions to write Eq. (lB4l > in terms of P\ (x,t), P 2 (x,t), Si (x,f), and S 2 (x,f). 
Without loss of generality and without assigning a particular meaning to the new symbols yet, we write 

Pi(x,t) =P(x,t)cos 2 - r - 
P 2 (x,t) =P(x,t)sirr^-^!l. 


Then we have 

2 


(VfMx,0) 2 = 7 


4P*(x,f) 


1 


0(x,f) 


4 j Pi(x,f) 
1 


VP(x,f)cos 2 —P(x,t )cos " v *'‘' sin " ' ~’‘ y VO(x.t) 


0(x,t) . d(x,t), 
— sin — 


+ 


P 2 (x 1 t) 


VP(x,t) sin 


2 B(x,t) 


. , 0(x,t) 0(x,f)_„. 

+ P(x, f) cos —-—- sin —-—- V0 (x, f 


2 


(B4) 

(B5) 

(B6) 

(B7) 

(B8) 

(B9) 


(BIO) 

(Bll) 

(B12) 

(B13) 

(B14) 


(B15) 


= ? {?77jl vp < x -')] 2+i> < x -') [V0(*,O] 2 } ■ 

(B16) 

</>(x,f)O t O = </>(x,f)P(x,D, 

(B17) 

0 + cr v 0 = P(x,t )sin 0(x,/)cos , 

ft 

(B18) 

+ „ , , x S 2 (x,t)-S<(x,t) 

O’ <7 y O = P(x,t) sin 0(x,f) sin---, 

h 

(B19) 

O^crO = P(x,f)cos 0(x,f)- 

(B20) 


Similarly, 
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Next, we introduce 


and obtain 


Si(x,t)=S(x,t)~ 
S 2 (x,t) =S(x,t) + 


h(p(x,t) 

2 

Ti(p(x,t) 


(B21) 


ih ( 




E 

k=l L 


2 V dt 


VS k (x,t)-qA(x,t) 


+ <9<E>\ dS(x,t) r . , h d(p(x,t) , . 

^ [Bi(x,f)+P 2 (x,t)]-- ^ [Bi(x,0--P 2 (x,0] 

P(x,f), 


dt 2 K ' dt 


(B22) 


P k (x,t)=P{x,t)j^ 


1 — (—l) A COS0(x,f) 


*=1 


V5(x,f) -qA(x,t) + V(p(x,f) 


= | [V5(x,f) — gA(x,f)] 2 + E [V(j?(x,t)] 2 | P(x,f) 

—Ticos 0(x,f)V<p(x,f) [VS(x,f) — ^A(x,f)]P(x,f) 
$ 1 cr v <I> = P (x, t ) sin 0 (x, t) cos <p (x, t ), 

<J>’ <7- v <I> = P(x,f)sin0(x,f)sinp(x,f), 

<f>’ < 7 '<J) = P(x,f)cOS 0(x,f). 


(B23) 

(B24) 

(B25) 

(B26) 


Collecting all terms we find 

s2 


Q = 


J dxd '(L{ 


P(x,t) 


[VP(x,f)] 2 +[V0(x,f)] 2 B(x,O + 


^£)_» cose(x ,,)^£) 

dt 2 v ' dt 


P(x,t) 


+- l — | [VS(x,f) - </A(x,f)]~ + E [V<p(x,f)] 2 — Ticos 0(x,f)V<p(x,f) [VS(x,f) -<?A(x,f)] |p(x,f) 


+q(j)(x,t)P(x,t) —— [B v sin0(x,f)cos(p(x,f) + B v sin0(x,f)sin<p(x,f) +B z cos9(x,t)]P(x,t) ). 

2m 11 


(B27) 


Appendix C: Classical mechanics of a magnetic moment 

For completeness, we collect some well-known facts about the classical mechanical description of the rotational motion 
of a magnetic moment M = M(f) = (M x ,M y ,M z ) T which does not move and interacts with a magnetic field B = B(f) = 
C B x ,B y ,B z ) T 0. The motion of the magnetic moment is completely determined by the torque equation 


dM 

dt 


= yM x B, 


where y is the gyromagnetic ratio. In terms of components we have 

dM x 

= y(M y B z — M z By) 


dM y 

dt 

dM z 

dt 


= y{M z B x -M x B z ) 
= y(M x B y — M y B x ). 


Assuming that the total magnetic moment Mq = (M 2 +M~ +Mr) 1 ' 2 is constant, we can write M = Mom where 


(Cl) 


(C2) 


m = (cos (p sin 0, sin (p sin 0, cos 0) r , 

is a unit vector and (0, <p) are its spherical coordinates. The equations of motion of these coordinates read 

dtp d arct<m(m y / m x ) 


dt 


dt 


(C3) 
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where z = cos 6 and 


If we define a Hamiltonian Hm by 


it follows that 


m% + mj, 


drily 


dm x 

dt 


m- 


r (m x B x + m y B y ) 


B z I —7J-y 

m- + mt; 

z 

—B z H— -j== (By cos <p + B y sin (p) 
V1 -z 2 


= 7 

= 7 

dz 

— = Y(m x B y - m y B x ) 
dt 

= 7 \J 1 — z 2 (—Bysincp+Bycoscp) . 

Hm = —7 (B^sinS cos <p+,B v sin 0 sin <p+ZL cos 0) 


= -7 


zB- + \/1 — z 2 {B x cos (p + By sin <p) 


dtp dH\i 

dt dz 


c/z 

dt 


dH, 


M 


dtp 


(C4) 


(C5) 


(C6) 


(C7) 


From Eq. ( 1C7I >. it follows that the pair f<p,z) are conjugate variables. 

Finally, it is easy to check that the equations of motion Eq. (IC7b follow by searching for the extremum of the functional 


I* h d i 


+ Hm • 


(C8) 
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